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(57) Abstract 

Secretory leader sequences, for use in secreting heterologous polypeptides in yeast, are formed by fusing part of the human 
serum albumin pre-sequence or part of the Kluyveromyces lactis killer toxin pre-sequence to the Saccharomyces cerevisiae mating 
factor alpha- 1 KEX1 cleavage recognition site. The resulting fusion leader sequences are: (a) H 2 N-Met-Lys-Trp-Val-Ser-Phe- 
Ile-Ser-Leu-Leu-Phe-Leu-Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH or (b) H 2 N-Met-Asn-Ile-Phe-Tyr-Ile- 
Phe-Leu-Phe-Leu-Leu-Ser-Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH. Conservative variations are also encompassed. 
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New secretory leader sequences. 

This invention relates to secretory leader sequences 
which can be employed to direct the secretion of a 
heterologous protein (such as human serum albumin) from 
fungi (for example the yeast Sac char omyces cerevisiae ) . 

Translocation of protein molecules through bi- lipid 
membranes from one cellular compartment to another 
generally relies upon information held within the primary 
amino acid sequence of the protein itself . The most 
prevalent and therefore the best characterised sequence 
information is the amino terminal leader or signal 
sequence of prokaryotic and eukaryotic organisms . Genetic 
studies in which the signal sequence has been totally or 
extensively deleted indicate that the signal sequence is 
essential for protein translocation (Benson, S.A. et al. 
1985, Ann, Rev. Biochem. 54_, 101-134), Among several 
hundred known sequences (Watson, M. E . E . , 1984, Nuc . Acid. 
Res. 3^, 5145-5164) no consensus signal sequence or even 
an absolute requirement for any amino acid at any given 
position can be discerned, although a common feature of 
many leader sequences is a core of 7-10 hydrophobic amino 
acids . Genetic manipulations which result in alterations 
to the hydrophobic core, either by deletion or by 
inserting charged residues, generally result in a block 
in protein translocation (Benson, S.A., et al. 1985, Ann. 
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Rev. Biochem. S4, 101-134). Moreover, ±n a series of 
extensive modifications to the chicken lysozyme leader 
sequence,, Yamamoto et al. 1987 (Biochem. and Biophys . 
Res. Comm. 149 , 431-436) have shown that, while some 
alterations to the hydrophobic core can result in the 
abolition of secretion, others can potentiate the leader 
sequence function, resulting in increased levels of 
protein secretion . 

While the leader sequence is usually essential for 
the translocation of proteins across membranes, once 
translocated these sequences are usually endoproteo- 
lytically cleaved by enzymes contained within the 
cellular compartments into which the proteins have now 
moved. These enzymes recognise specific amino acid 
sequences within the primary structure of the 
translocated protein. Moreover, complete processing of 
certain eukaryotic proteins to their mature form often 
relies upon a series of proteolytic cleavages (Bussey, 
H., 1988 Yeast 4, 17-26). 

With the recent advances in recombinant DNA 
technology, increasing resources have been brought to 
bear on the commercial exploition of fungi, particularly 
yeasts, as vehicles for the production of a diverse range 
of proteins • 
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Since many of these proteins are themselves 
naturally secreted products, it is possible to utilise 
the information contained within the leader sequence to 
direct the protein through the secretion pathway. 
However, this information is contained within a peptide 
foreign to yeast. Its recognition and subsequent 
processing by the yeast secretory pathway are not 
necessarily as efficient as those of a homologous yeast 
leader sequence. As a consequence an alternative approach 
has been to replace the leader sequence with one derived 
from a naturally secreted yeast protein. 

The most widely used yeast secretory sequence is the 
8 9 amino acid leader sequence of the alpha-factor mating 
pheromone. Processing of this leader has been extensively 
studied (Kurjan & Herskowitz, Cell 3_0, 933-943, 1982; 
Julius et al. 198 3 Cell 32^, 839-852; Dmochowska* et al . 
Cell 50., 573-584, 1987; Julius et al. Cell 36 : 309-318, 
1984; Julius et al. Cell 2Z' 1075-1085, 1984) and 
requires at least four gene products for complete 
proteolytic cleavage to liberate the mature 13 amino acid 
alpha-factor pheromone . 

Complete proteolytic cleavage of the alpha- factor 
primary translation product requires first the removal of 
the N-terminal 19 amino acid signal sequence by a signal 
peptidase within the endoplasmic reticulum. Following 
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this the sequential action of three gene products located 
within the golgi apparatus processes the large precursor 
molecule, liberating four copies of the alpha-factor 
pheromone. These are the KSX2 gene product, an 
endopeptidase which cleaves after the Lys-Arg dibasic 
amino acid pair, a carboxypeptidase £-like cleavage, 
recently identified as the product of the KEX1 gene, and 
a dipeptidyl amino peptidase, the product of the STE13 
gene, which sequentially removes the Glu-Ala or Asp-Ala 
diamino acid pairing preceding the mature alpha-factor 
pheromone . 

The alpha factor prepro leader sequence has 
successfully been employed to secrete a range of diverse 
proteins and peptides. However, when the alpha-factor 
signal is used to direct secretion of human serum 
albumin, we have found that a large proportion of the 
extracellular HSA produced is in the form of a 45KD 
N-terminal fragment, 

EP-A-25 2 561 (Sclavo) discloses the use of the 16 
amino acid signal peptide (pre- sequence) from the killer 
toxin of Kluyveromyces lactis to aid secretion of 
heterologous proteins in yeast. 
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A further possibility is to use a fusion secretory 
leader sequence. This may be generated by the fusion of 
two independent sequences. A hybrid signal in which the 
first amino acids of the acid phosphatase signal were 
fused to the proteolytic cleavage site of human alpha 
interferon resulted in the expression and secretion of 
interferon (Hinnen et al . Foundation for Biochemical and 
Industrial Fermentation Research, 229 , 1219-1224, 1983); 
10% of the interferon produced was secreted into the 
medium. In a similar approach the first 22 amino acids of 
the alpha-factor leader were fused to the last twelve 
amino acids of the human interferon alpha- 2 signal 
sequence resulting in the secretion of interferon alpha-2 
into the culture supernatant (Piggott et al. Curr. Genet. 
12 561-567, 1987). An identical construct in which the 
interferon alpha-2 gene was replaced by the interferon j3 
gene did not result in any secretion of human interferon 
£ into the culture supernatant. Finally, in a series of 
experiments designed to assess the effect of leader 
sequences on the secretion of human lysozyme, Yoshimura 
et al. (Biochem. & Biophys . Res. Comm. 145 / 712-718, 
1987) described a fusion leader comprising the first 9 
amino acids of the chicken lysozyme leader and the last 9 
amino acids of the Aspergillus awamori glycoamylase 
leader. Although this fusion leader was effective in 
secreting 60% of the produced material into the culture 
supernatant, it was only 15% as effective as the entire 
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chicken lysozyme leader. Moreover, no secreted product 
could be detected if the human lysozyme sequences were 
preceded by the entire Aspergillus glycoamylase leader, 
or a. fusion derived from the first 9 amino acids of the 
Aspergillus glucoamylase leader and the last 9 amino 
acids of the chicken lysozyme leader. 

We have now devised new and advantageous leader 
sequences for use in fungi. 

One aspect of the invention provides an amino acid 
sequence as follows : 

( a ) H 2 N-M;et-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-Leu-Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-riys-Arg-COOH 

or 

( b ) H2N-Met-Asn-Ile-Phe-Tyr-Ile-Phe-Leu-Phe-Leu-Leu-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

or conservatively modified variations of either sequence. 

Table 1 shows alternative amino acids for each 
position except the initial methionine. Any of the 
possible permutations are within the scope of the 
invention. The selection of lysine or arginine for the 
last two positions is particularly non-critical, although 
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there should always be Lys or Arg at each of these 
positions. Preferably, positions 20 and 21 of sequence 
(a) are not Gly and Val respectively. Sequences which 
are up to four amino acids shorter or longer are also 
included provided that the C-terminal (Lys, Arg), Lys-Lys 
or Arg-Arg entity is maintained, there is a positively 
charged residue within 5 residues of the N-terminus and 
there is a generally hydrophobic region at or adjacent 
the middle of the sequence • 
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Table 1 



Leader ( a ) 



1 10 

Met Lys Trp Val Ser Phe lie Ser Leu Leu Phe Leu Phe Ser 
Arg Phe Leu Thr Trp Leu Thr lie lie Trp lie Trp Thr 
His Tyr lie Gly Tyr Val Gly Val Val Tyr Val Tyr Gly 
Gin Met Ala Met Ala Met Met Met Ala 

Asn 



20 

Ser Ala Tyr Ser Arg Ser Leu Asp Lys Arg 
Thr Thr Phe Thr Lys Thr lie Glu Arg Lys 
Gly Gly Trp Gly His Gly Val Asn 
Ala Ser Ala Gin Ala Met Gin 

Asn His 



substitute: sheet 
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Leader ( b ) 



Met Asn lie Phe Tyr lie Phe 
Asp Leu Trp Phe Leu Trp 
Glu Val Tyr Trp Val Tyr 
Gin Met Met 
His 



Leu Phe Leu Leu Ser Phe Val 
lie Trp lie lie Thr Trp Leu 
Val Tyr Val Val Gly Tyr lie 
Met Met Met Ala Met 



Gin Gly Ser Leu Asp Lys Arg 
Asp Ser Thr lie Asn Arg Lys 
Asn Thr Gly Val Glu 
Glu Ala Ala Met Gin 
His His 



A second aspect provides a fusion compound 
comprising any of the said amino acid sequences linked, 
preferably directly, at the carboxyl terminal to the N- 
terminal residue of a polypeptide. The polypeptide may be 
any desired polypeptide, including "pro-polypeptides " (in 
other words precursors which undergo post-trans lational 
cleavage or other modification, such as glycosylation ) . 
The term "polypeptide" encompasses oligopeptides. The 
polypeptide may be fibronectin or a portion thereof (for 
example the collagen or fibrin-binding portions described 
in EP 207 751), urokinase, pro-urokinase, the 1-368 
portion of CD4 (D. Smith et al (1987) Science 328 , 1704- 
1707), platelet derived growth factor (Collins et al 
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(1985) Nature 316 , 748-750), transforming growth factor 0 
(Derynck et al (1985) Nature 316 , 701-705), the 1-272 
portion of Von Willebrand's Factor (Bontham et al, Nucl. 
Acids Res. 14 7125-7127), the Cathepsin D fragment of 
fibronectin (585-1578) , ai -anti trypsin, plasminogen 
activator inhibitors, factor VIII, a-globin, 0-globin, 
myoglobin or nerve growth factor or a conservative 
variant of any of these. The polypeptide may also be a 
fusion of HSA or an N- terminal portion thereof and any 
other polypeptide, such as those listed above. 
Preferably, the polypeptide is a naturally- occurring 
human serum albumin, a modified human serum albumin or a 
fragment of either, such modified forms and fragments 
being termed "variants " . These variants include all forms 
or fragments of HSA which fulfill at least one of the 
physiological functions of HSA and which are sufficiently 
similar to HSA, in terms of structure (particularly 
tertiary structure) as to be regarded by the skilled man 
as forms or fragments of HSA. 



In particular variants or fragments of HSA which 
retain at least 50% of its ligand-binding properties, for 
example with respect to bilirubin or fatty acids, 
(preferably 80%, or 95%) are encompassed. Such 
properties are discussed in Brown, J.R. & Shockley, P. 
(1982) in Lipid-Protein Interactions l r 26-68, Ed. Jost, 
P.C. & Griffith, O.H. 
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The portion of HSA disclosed in EP 322 094 is an 
example of a useful fragment of HSA which may be secreted 
by use of the leader sequences of the invention. 

A third aspect provides a nucleotide sequence coding 
for any of the said amino acid sequences or for the said 
fusion compound. The nucleotide sequence (or the portion 
thereof encoding the leader sequence) may be selected 
from the possibilities shown in Tables 2 & 3, for 
sequences (a) and (b) respectively, where the codons 
encoding each amino acid are listed under the amino 
acids. The codons of Tables 2 and 3 clearly relate to 
RNA, but it is to be understood that equivalent DNA 
nucleotide sequences are also within the scope of this 
aspect of the invention. 
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Table 2 



Met Lys Trp Val 
AUG AAA UGG GUU 
AAG GUC 

GUA 
GUG 



Ser Phe lie Ser Leu 
UCU UUU AUU UCU UUA 
UCC UUC AUC UCC UUG 
UCA AUA UCA CUU 

UCG UCG CUC 

AGU AGU CUA 

AGC AGC CUG 



Leu Phe Leu Phe Ser 
UUA UUU UUA UUU UCU 
UUG UUC UUG UUC UCC 



CUU 


CUU 


UCA 


CUC 


CUC 


UCG 


CUA 


CUA . 


AGU 


CUG 


CUG 


AGC 



Ser 


Ala 


Tyr 


Ser 


Arg 


Ser 


Leu 


Asp 


Lys 


Arg 


UCU 


GCU 


UAU 


UCU 


CGU 


UCU 


UUA 


GAU 


AAA 


CGU 


UCC 


GCC 


UAC 


UCC 


CGC 


UCC 


UUG 


GAG 


AAG 


CGC 


UCA 


GCA 




UCA 


CGA 


UCA 


CUU 






CGA 


UCG 


GCG 




UCG 


CGG 


UCG 


CUC 






CGG 


AGU 






AGU 


AGA 


AGU 


CUA 






AGA 


AGC 






AGC 


AGG 


AGC 


CUG 






AGG 
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Table 3 



Met Asn lie Phe Tyr lie Phe 
AUG AAU AUU UUU UAU AUU UUU 
AAC AUC UUC UAC AUC UUC 
AUA AUA 



Leu Phe Leu Leu Ser Phe Val 

UUA UUU UUA UUA UCU UUU GUU 

UUG UUC UUG UUG UCC UUC GUC 

CUU CUU CUU UCA GUA 

CUC CUC CUC UCG GUG 

CUA CUA CUA AGU 

CUG CUG CUG AGC 



Gin Gly Ser Leu Asp Lys Arg 
CAA GGU UCU UUA GAU AAA CGU 
CAG GGC UCC UUG GAC AAG CGC 
GGA UCA CUU CGA 
GGG UCG CUC CGG 
AGU CUA AGA 
AGC CUG AGG 



A fourth aspect provides a DNA construct comprising a 
suitable control region or regions and a nucleotide 
sequence as defined above, the sequence being under the 
control of the control region. By "suitable control 
region" we mean such DNA regions as are necessary to 
enable the said nucleotide sequence to be expressed in 
the host for which the construct is intended. The control 
region will usually include transcriptional start and 
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stop sequences f 3 ' -polyadenylation sequences, a promoter 
and, often, an upstream activation site for the promoter. 
The man skilled in the art will readily be able to select 
and assemble suitable regions from those available in 
this art. However, specific examples of suitable 

expression vectors and their construction include those 
disclosed in EP 198 745, GB 2 171 703 (for B . subtilis ) , 
EP 207 165, EP 115 201, EP 123 244, EP 123 544, EP 147 
198, EP 201 239, EP 248 637, EP 251 744, EP 258 067, EP 
286 424 and EP 322 094. 



A fifth aspect provides a host transformed with the 
said DNA construct. The host may be any host in which the 
construct is found to work adequately, including 
bacteria, yeasts, filamentous fungi, insect cells, plant 
cells and animal cells. Preferably, however, the host is 
Sacchar omyces cerevisiae or Schizosac charomvc e s pombe , 
most preferably the former. As many native secretion 
signals are effective in heterologous hosts (for example 
the natural HSA leader sequence in yeast) it is entirely 
reasonable to suppose that the leader sequences of the 
invention will function in hosts other than yeasts. 

A sixth aspect provides a process for preparing a 
polypeptide , comprising cultivating the said host and 
obtaining therefrom the polypeptide expressed by the said 
nucleotide sequence, or a modified version thereof. 
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By "modified version thereof", we mean that the actual 
polypeptide which is separated may have been post- 
translationally modified, in particular by cleavage of 
the leader sequence . 

A seventh aspect provides a polypeptide prepared by 
such a process. 

So that the invention may be more readily 
understood, preferred aspects will now be illustrated by 
way of example and with reference to the accompanying 
drawings in which t 

Figure 1 is a restriction map of plasmid pEK113; 

Figure 2 is a restriction map of plasmid pEK25; 

Figure 3 is a restriction map of plasmid pAYE230; 

Figure 4 is a restriction map of plasmid pAYE238; 

Figure 5 is a restriction map of plasmid pAYE305; 

and 

Figure 6 is a restriction map of plasmid pAYE305 . 
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Example of a prior art -type of leader sequence 

The DNA coding sequence for mature HSA protein has 
been placed immediately downstream of a DNA sequence 
encoding the KEX2 cleavage site of the alpha factor pre 
pro leader sequence (85 amino acids). When this protein 
sequence is placed under the control of a promoter on a 
yeast autonomously replicating plasmid and transformed 
into a haploid strain of the yeast Sac char omy c e s 
cerevisiae , mature HSA can be detected in the culture 
supernatant. N-terminal amino acid sequence information 
indicates that the secreted protein has the same N- 
terminal amino acid composition as natural HSA, namely 
Asp-Ala-His. This also indicates that the first two amino 
acids of the secreted HSA are not susceptible to the 
dipeptidyl endopeptidase , the product of the STE13 gene, 
as this enzyme is responsible for the removal of such 
sequences from between successive repeats of the alpha- 
factor pheromone. Although mature HSA is the major 
product observed in the culture supernatant, a N-terminal 
fragment of HSA (45 kilodaltons) was also detected, 
representing approximately 15% of the total HSA 
synthesised. This fragment component represents not only 
a waste of secretion capacity but also certain downstream 
purification problems in that, as a fragment of HSA, it 
shares some biochemical and biophysical properties with 
intact HSA. 
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EXAMPLE 1 

We have constructed a fusion leader which may be 
regarded as the natural HSA leader sequence from which 
the last five amino acids have been removed, to be 
replaced by the five amino acids preceding the KEX2 
cleavage site of the alpha-factor pre pro leader 
sequence, i.e. amino acids 81 to 85 are Ser-Leu-Asp-Lys- 
Arg (Table 2) . 

When transformed with suitable plasmid vectors 
incorporating the fusion leader, yeast secrete mature HSA 
into the culture supernatant at levels comparable to that 
observed with the alpha-factor leader sequence- N- 
terminal sequence analysis indicates that the mature HSA 
possesses the correct N-terminal amino acid composition. 

Moreover, substitution of the alpha-factor leader by 
the fusion leader sequence has been found to result in a 
6 fold reduction in the levels of the 45 kd fragment 
observed in the culture supernatant. This therefore 
represents a significant improvement in the reduction of 
the contaminating polypeptides, thus aiding the 
purification of mature HSA from yeast culture 
supernatants . 
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Details 

Unless otherwise stated all procedures were carried 
out as described by Maniatis et al (1982). Plasmid pEK113 
(Figure 1) (EP-A-248 637) was digested to completion with 
the restriction endonucleases Mst XI and Hind i I I . DNA was 
recovered by phenol/chloroform extraction and ethanol 
precipitation. The linearised plasmid DNA was then 
treated with the Klenow fragment of S . coli DNA polymerase 
I to generate a linearised DNA molecule with blunt ends . 

The following oligonucleotide duplex (I) was 
constructed on an automated Applied Biosystems Inc 38 OB 
DNA synthesiser (according to manufacturer's 
instructions ) . 

Ollcronucleotide I 

5' 3' 

GGC TTA TAA GGA TCC TTA TAA GCC 

CCG AAT ATT CCT AGG AAT ATT CGG 

The oligonucleotide duplex was ligated with 
equimolar quantities of linearised r blunt ended pEK113. 
E » coll strain MC1061 was transformed with the ligation 
mixture and cells receiving DNA were selected on an 
ampicillin-containing medium (5 0ug/ml ampicillin) . 
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Recombinant plasmids containing the oligonucleotide 
duplex were screened by digesting DNA prepared from 
individual colonies with the restriction endonucleases 
Mstll and Eco RI . Plasmid pEK25 was thus formed (Figure 

2). 

Plasmid pEK25 was digested to completion with the 
restriction endonucleases Xba l and BamH I , DNA fragments 
were separated by electrophoresis through a 1% (w/v) 
agarose gel and a 6 88 base pair Xba l - BamH I DNA fragment 
recovered from the gel by electroelution. 

The plasmid mpl9.7 (EP-A-248 637) was digested to 
completion with the restriction endonuclease Xho l . The 
linearised DNA was phenol/chloroform extracted and 
ethanol precipitated. The recovered DNA was then treated 
with the Klenow fragment of E. coli DNA polymerase I as 
previously described, following which the DNA was 
phenol /chloroform extracted and ethanol precipitated. The 
recovered DNA was then digested to completion with Xbal 
and the digestion products separated by agarose gel 
electrophoresis. A 1067 base pair fragment was recovered 
from the gel by electroelution. The following oligo- 
nucleotide duplex (II) was prepared as described 
previously . 



SUBSTITUTE SHEET 



WO 90/01063 



PCT/GB89/00816 



20 

Oligonucleotide II 
5' 

GATCC ATG AAG TGG GTA AGC TTT ATT TCC CTT CTT TTT CTC 
TAC TTC ACC CAT TCG AAA TAA AGG G£A GAA AAA GAG 

3' 

TTT AGC TCG GCT TAT TCC AGG AGC TTG GAT AAA AGA 
AAA TCG AGC CGA ATA AGG TCC TCG AAC CTA TTT TCT 

The plasmid pUC19 (Yanisch-Perron et al . 1985) was 
digested to completion with the restriction endonuclease 
BamH I . Linearised DNA was recovered by phenol /chloroform 
extraction and ethanol precipitation. 

Equimolar quantities of the BamH I digested pUC19, 
the oligonucleotide duplex II T the 10 67 b.p. DNA fragment 
derived from mpl9 . 7 and the 6 88 b.p. DNA fragment derived 
from pEK25 were ligated together. E.coli DH5 was trans- 
formed with the ligated DNA and trans formants selected on 
50ug/ml ampicillin It-broth agar. Recombinant colonies 
containing the desired plasmid, designated pAYE 230 
(Figure 3) were selected by digested DNA obtained from 
individual colonies with the restriction endonuclease 
BamHI . 
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Plasmid pAYE 230 was digested to completion with 
BamHI and the products separated by electrophoresis 
through a 1% agarose gel. The 1832 base pair fragment 
containing the HSA coding sequence was recovered by 
electroelution ♦ 

Plasmid pMA91 (Mellor et al. 19 83) was digested to 
completion with Bgl ll under standard conditions. The 
linearised plasmid was phenol /chloroform extracted and 
ethanol precipitated . 

Equivalent quantities of the linearised pMA91 and 
the DNA fragment prepared from pAYE 23 0 were ligated 
under standard conditions. E, coll DH5 was transformed 
with the ligation mixture and cells receiving the DNA 
selected on L-broth agar containing 50p.g/ml ampicillin. 
Colonies containing the desired plasmid f designating pAYE 
238 (Figure 4) were selected by digesting the DNA from 
such colonies with PvuII . 

Plasmid pAYE 238 was transformed into the yeast 
S acchar omyce s cerevi s iae strain S150-2B as described by 
Hinnen et al. (1978). Cells receiving plasmid pAYE 238 
were selected on minimal medium, supplemented with 2% 
(w/v) glucose , 20mg/l histidine, 20mg/l tryptophan and 
20mg/l uracil. 
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Transformed S150-2B cells were transferred to 10ml 
YEPD media containing 2% (w/v) glucose and incubated at 
30°C r 200rpm for 72 hours. Cell free culture supernatants 
were analysed by discontinuous native 8-25% gradient 
polyacrylamide gel electrophoresis on a Pharmacia Phast 
System, as described in the manufacturer's instructions. 
Cells were stained and destained and the relative 
quantities of native HSA and HSA fragment estimated by 
gel scan at 595nm. 

EXAMPLE 2 

We have also constructed a second fusion leader which 
consists of the IS amino acid pre region of the 97,000 
dalton Kluyveromyces lactis killer (ORF 2) toxin (Stark 
and Boyd, 1986, Tokumaga et al 1987) fused to the five 
amino acids preceding the KEX2 cleavage site of the 
alpha- factor prepro leader sequence, i.e. amino acids 81 
to 85, Ser-Leu-Asp-Lys-Arg (Table 3). 

When transformed with plasmid vectors incorporating the 
fusion leader described in Table 3, yeast secreted mature 
HSA into the culture supernatants at levels higher than 
when either the natural K, lactis prepro killer toxin 
leader sequence or the alpha-factor prepro leader 
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sequence was used. N-terminal sequence analysis 

indicates that the mature HSA possesses the correct N- 
terminal amino acid composition. 

Substitution of the alpha-factor leader by the K . lactis 
killer/alpha factor fusion leader sequence resulted in a 
six fold reduction in the levels of the 45kd fragment 
observed in the culture supernatant. This therefore 
represents a significant improvement in the reduction of 
the contaminating polypeptides, thus aiding the 
purification of mature HSA from yeast culture 
supernatants . 

Details 

The experimental procedures employed to generate a yeast 
HSA secretion vector utilising the K. lactis killer/alpha 
factor fusion leader were identical to those described in 
Example 1, except that oligonucleotide duplex (II) was 
replaced by oligonucleotide duplex (III) synthesised on 
an automated Applied Biosystems Inc. 380B DNA synthesiser 
(according to manufacturer's instructions). 

Oligonucleotide duplex III 

GATCC ATG AAT ATA TTT TAG ATA TTT TTG TTT TTG CTG TCA TTC 
TAC TTA TAT AAA ATG TAT AAA AAC AAA AAC GAC AGT AAG 
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GTT CAA GGA AGC TTG GAT AAA AGA 
CAA GTT CCT TCG AAC CTA TTT TCT 

Equimolar quantities of the BamH I digested pUC19 , the 
oligonucleotide duplex III, the 1067bp DNA fragment 
derived from mpl9.7 and the 6 8 8b. p. DNA fragment derived 
from pEK25 were ligated together- E.coli DH5 was 
transformed with ligated DNA and trans formants selected 
on 50*ig/ml ampicillin L-broth agar. Recombinant colonies 
containing the desired plasmid, designated pAYE304 
( Figure 5 ) , were selected by digested DNA obtained from 
individual colonies with the restriction endonuclease 
BamHI. 



Plasmid pAYE304 was digested to completion with BamH I and 
the products separated by electrophoresis through a 1% 
agarose gel. The 1823 base pair fragment containing the 
HSA coding sequence was recovered by electroelution. 

Plasmid pMA91 (Mellor et al, 1983) was digested to 
completion with Bglll under standard conditions . The 
linearised plasmid was phenol/ chloroform extracted and 
ethanol precipitated. 

Equivalent quantities of the linearised pMA91 and the DNA 
fragment prepared from pAYE304 were ligated under 
standard conditions. E. coli DH5 was transformed with the 
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ligation mixture and cells receiving DNA selected on L- 
broth agar containing 50fxg/ml ampicillin. Colonies 
containing the desired plasmid, designating pAYE3 05 
(Figure 6), were selected by digesting the DNA from such 
colonies with PvuII. 



Plasmid pAYE305 was transformed into the yeast 
Saccharomyces cerevisiae strain S150-2B as described by 
Hinnen et al, (1978). Cells receiving plasmid pAYE305 
were selected on minimal medium, supplemented with 2% 
(w/v) glucose, 20mg/l histidine, 20mg/l tryptophan and 
2 Omg/ 1 uracil . 

Transformed S150-2B cells were transferred to 10ml YEPD 
medium containing 2% (w/v) glucose and incubated at 30°C, 
200rpm for 72 hours. Cell free culture supernatants were 
analysed by discontinuous native 8-25% gradient 
polyacrylamide gel electrophoresis on a Pharmacia Phast 
System, as described in the manufacturer's instructions. 

Cells were stained and destained and the relative 
quantities of native HSA and HSA fragment estimated by 
gel scan at 595nm. 

EXAMPLE 3 

Using a vector based on the disintegration vectors of 
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EP286424 (Delta Biotechnology) , a suitable promoter and 
the fusion leader of Example 1 above, Schizosaccharomvces 
P Qxnbe (strain Leul.32h) was transformed and fermented at 
30°C in 10ml of EMM (Edinburgh minimal medium, Ogden, 
J.E. & Fantes, P. A. (1986) Curr. Genetics 10 509-514), 
buffered to pH 5.6 with 0 • 1M citric acid/sodium 

phosphate, to give 10-15 mg/1 of HSA in the culture 
supernatant after 3 days. 
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CLAIMS 

1. An amino acid sequence as follows: 

( a ) H2N-Met-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-Leu-Phe-Leu- 
Phe-Ser-Ser~Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH 

or 

(b) H2N-Met-Asn-Ile-Phe~Tyr-Ile-Phe-Leu-Phe-Leu-Leu-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

or conservatively modified variants of either sequence. 

2 . A fusion compound comprising an amino acid sequence 
according to Claim 1 linked at the carboxyl terminal to 
the N-terminal residue of a polypeptide • 

3- A fusion compound according to Claim 2 wherein the 
said amino acid sequence is linked directly to said 
polypeptide . 

4 • A fusion compound according to Claim 3 wherein the 
polypeptide is a naturally-occurring human serum albumin, 
a modified human serum albumin or a fragment of either. 

5 . A nucleotide sequence coding for the amino acid 
sequence of Claim 1 or for a fusion compound according to 
Claim 2 . 
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6 . A nucleotide sequence according to Claim 5 selected 
from the sequences shown in Table 2 or 3 . 

7 . A DNA construct comprising a suitable control region 
or regions and a nucleotide sequence according to Claim 5 
or 6 r the sequence being under the control of the control 
region . 

8. A host transformed with a DNA construct according to 
Claim 7 • 

9. Saccharomyces cerevisiae or Schizosaccharomyces 
pombe according to Claim 8 . 

10* A process for preparing a polypeptide, comprising 
cultivating a host according to Claim 8 or 9 and 
obtaining therefrom the polypeptide expressed by the said 
nucleotide sequence or a modified version thereof . 

11. A polypeptide prepared by a process according to 
Claim 10. 

12 . Human serum albumin or a variant thereof prepared by 
a process according to Claim 10. 
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